The Phonemic Approach for Sanskrit Text
نویسندگان
چکیده
Sanskrit Grammar has distinguished the terms varna (phoneme) and akshara (syllable). Both these terms are used in the context of spoken languages and can be extended to written languages. Since the oral tradition in India was of a higher order, the stress on right pronunciation was laid at most on the spoken language. To represent such speech nuances in written language, various chinhas (signs) were evolved as to strike the equivalence in spoken and written expressions. This extra-ordinary activity is part of the Indian tradition. Diagram 1 Therefore, the realization of such phonemic system in the context of new technology seems to be imperative where writing is talked in the context of speech and speech in the context of writing. The attempt is made to identify varnamala comprising of basic speech sound units as vowel phonemes (swara varna) and consonant phonemes (vyanjan varna) (Diagram 1). These phonemes (varnas) when combined as C...C + V or only V, form complete phonetic cluster. The correspondence in spoken and written syllables must be preserved through the Phonemic scheme firstly by giving each phoneme a distinct identity and secondly by giving each chinha -denoting nuances of speech -a distinct identity in the form of standardisation.
منابع مشابه
Automatic Sanskrit Segmentizer Using Finite State Transducers
In this paper, we propose a novel method for automatic segmentation of a Sanskrit string into different words. The input for our segmentizer is a Sanskrit string either encoded as a Unicode string or as a Roman transliterated string and the output is a set of possible splits with weights associated with each of them. We followed two different approaches to segment a Sanskrit text using sandhi1 ...
متن کاملEnglish Speech to Sanskrit Speech (ESSS) using Rule based Translation
The development of such system that automatically recognizes the input speech and translates in another language like Sanskrit is a challenging task. Sanskrit language is much more conjured language. The purpose of this paper is to explain a system which convert the English Speech into English text and then translate that English text into Sanskrit text and again convert that into speech. This ...
متن کاملA Sandhi Splitter for Malayalam
Sandhi splitting is the primary task for computational processing of text in Sanskrit and Dravidian languages. In these languages, words can join together with morpho-phonemic changes at the point of joining. This phenomenon is known as Sandhi. Sandhi splitter splits the string of conjoined words into individual words. Accurate execution of sandhi splitting is crucial for text processing tasks ...
متن کاملBuilding a Prototype Text to Speech for Sanskrit
This paper describes about the work done in building a prototype text to speech system for Sanskrit. A basic prototype text-tospeech is built using a simplified Sanskrit phone set, and employing a unit selection technique, where prerecorded sub-word units are concatenated to synthesize a sentence. We also discuss the issues involved in building a full-fledged text-to-speech for Sanskrit.
متن کاملStop-Word Removal Algorithm and its Implementation for Sanskrit Language
In the Information era, optimization of processes for Information Retrieval, Text Summarization, Text and Data Analytic systems becomes utmost important. Therefore in order to achieve accuracy, extraction of redundant words with low or no semantic meaning must be filtered out. Such words are known as stopwords. Stopwords list has been developed for languages like English, Chinese, Arabic, Hindi...
متن کامل